Explore, edit and leverage genomic annotations using Python GTF toolkit.
Identifieur interne : 000539 ( Main/Exploration ); précédent : 000538; suivant : 000540Explore, edit and leverage genomic annotations using Python GTF toolkit.
Auteurs : F. Lopez [France] ; G. Charbonnier [France] ; Y. Kermezli [France, Algérie] ; M. Belhocine [Émirats arabes unis] ; Q. Ferré [France] ; N. Zweig ; M. Aribi [Algérie] ; A. Gonzalez [France] ; S. Spicuglia [France] ; D. Puthier [France]Source :
- Bioinformatics (Oxford, England) [ 1367-4811 ] ; 2019.
Descripteurs français
- KwdFr :
- MESH :
English descriptors
- KwdEn :
- MESH :
Abstract
MOTIVATION
While Python has become very popular in bioinformatics, a limited number of libraries exist for fast manipulation of gene coordinates in Ensembl GTF format.
RESULTS
We have developed the GTF toolkit Python package (pygtftk), which aims at providing easy and powerful manipulation of gene coordinates in GTF format. For optimal performances, the core engine of pygtftk is a C dynamic library (libgtftk) while the Python API provides usability and readability for developing scripts. Based on this Python package, we have developed the gtftk command line interface that contains 57 sub-commands (v0.9.10) to ease handling of GTF files. These commands may be used to (i) perform basic tasks (e.g. selections, insertions, updates or deletions of features/keys), (ii) select genes/transcripts based on various criteria (e.g. size, exon number, transcription start site location, intron length, GO terms) or (iii) carry out more advanced operations such as coverage analyses of genomic features using bigWig files to create faceted read-coverage diagrams. In conclusion, the pygtftk package greatly simplifies the annotation of GTF files with external information while providing advance tools to perform gene analyses.
AVAILABILITY AND IMPLEMENTATION
pygtftk and gtftk have been tested on Linux and MacOSX and are available from https://github.com/dputhier/pygtftk under the MIT license. The libgtftk dynamic library written in C is available from https://github.com/dputhier/libgtftk.
DOI: 10.1093/bioinformatics/btz116
PubMed: 30768152
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream PubMed, to step Corpus: 000279
- to stream PubMed, to step Curation: 000278
- to stream PubMed, to step Checkpoint: 000260
- to stream Main, to step Merge: 000539
- to stream Main, to step Curation: 000539
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Explore, edit and leverage genomic annotations using Python GTF toolkit.</title>
<author><name sortKey="Lopez, F" sort="Lopez, F" uniqKey="Lopez F" first="F" last="Lopez">F. Lopez</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Charbonnier, G" sort="Charbonnier, G" uniqKey="Charbonnier G" first="G" last="Charbonnier">G. Charbonnier</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Kermezli, Y" sort="Kermezli, Y" uniqKey="Kermezli Y" first="Y" last="Kermezli">Y. Kermezli</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><nlm:affiliation>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University, Algeria.</nlm:affiliation>
<country xml:lang="fr">Algérie</country>
<wicri:regionArea>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University</wicri:regionArea>
<wicri:noRegion>Tlemcen University</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Belhocine, M" sort="Belhocine, M" uniqKey="Belhocine M" first="M" last="Belhocine">M. Belhocine</name>
<affiliation wicri:level="1"><nlm:affiliation>Molecular Biology and Genetics Laboratory, Dubai, United Arab Emirates.</nlm:affiliation>
<country xml:lang="fr">Émirats arabes unis</country>
<wicri:regionArea>Molecular Biology and Genetics Laboratory, Dubai</wicri:regionArea>
<wicri:noRegion>Dubai</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Ferre, Q" sort="Ferre, Q" uniqKey="Ferre Q" first="Q" last="Ferré">Q. Ferré</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Zweig, N" sort="Zweig, N" uniqKey="Zweig N" first="N" last="Zweig">N. Zweig</name>
<affiliation><nlm:affiliation>Aix Marseille Univ.</nlm:affiliation>
<wicri:noCountry code="no comma">Aix Marseille Univ.</wicri:noCountry>
<wicri:noCountry code="no comma">Aix Marseille Univ.</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Aribi, M" sort="Aribi, M" uniqKey="Aribi M" first="M" last="Aribi">M. Aribi</name>
<affiliation wicri:level="1"><nlm:affiliation>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University, Algeria.</nlm:affiliation>
<country xml:lang="fr">Algérie</country>
<wicri:regionArea>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University</wicri:regionArea>
<wicri:noRegion>Tlemcen University</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Gonzalez, A" sort="Gonzalez, A" uniqKey="Gonzalez A" first="A" last="Gonzalez">A. Gonzalez</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Spicuglia, S" sort="Spicuglia, S" uniqKey="Spicuglia S" first="S" last="Spicuglia">S. Spicuglia</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
<affiliation><nlm:affiliation>Equipe Labellisée LIGUE contre le Cancer.</nlm:affiliation>
<wicri:noCountry code="no comma">Equipe Labellisée LIGUE contre le Cancer.</wicri:noCountry>
<wicri:noCountry code="no comma">Equipe Labellisée LIGUE contre le Cancer.</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Puthier, D" sort="Puthier, D" uniqKey="Puthier D" first="D" last="Puthier">D. Puthier</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">PubMed</idno>
<date when="2019">2019</date>
<idno type="RBID">pubmed:30768152</idno>
<idno type="pmid">30768152</idno>
<idno type="doi">10.1093/bioinformatics/btz116</idno>
<idno type="wicri:Area/PubMed/Corpus">000279</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">000279</idno>
<idno type="wicri:Area/PubMed/Curation">000278</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">000278</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000260</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">000260</idno>
<idno type="wicri:Area/Main/Merge">000539</idno>
<idno type="wicri:Area/Main/Curation">000539</idno>
<idno type="wicri:Area/Main/Exploration">000539</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Explore, edit and leverage genomic annotations using Python GTF toolkit.</title>
<author><name sortKey="Lopez, F" sort="Lopez, F" uniqKey="Lopez F" first="F" last="Lopez">F. Lopez</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Charbonnier, G" sort="Charbonnier, G" uniqKey="Charbonnier G" first="G" last="Charbonnier">G. Charbonnier</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Kermezli, Y" sort="Kermezli, Y" uniqKey="Kermezli Y" first="Y" last="Kermezli">Y. Kermezli</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1"><nlm:affiliation>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University, Algeria.</nlm:affiliation>
<country xml:lang="fr">Algérie</country>
<wicri:regionArea>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University</wicri:regionArea>
<wicri:noRegion>Tlemcen University</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Belhocine, M" sort="Belhocine, M" uniqKey="Belhocine M" first="M" last="Belhocine">M. Belhocine</name>
<affiliation wicri:level="1"><nlm:affiliation>Molecular Biology and Genetics Laboratory, Dubai, United Arab Emirates.</nlm:affiliation>
<country xml:lang="fr">Émirats arabes unis</country>
<wicri:regionArea>Molecular Biology and Genetics Laboratory, Dubai</wicri:regionArea>
<wicri:noRegion>Dubai</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Ferre, Q" sort="Ferre, Q" uniqKey="Ferre Q" first="Q" last="Ferré">Q. Ferré</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Zweig, N" sort="Zweig, N" uniqKey="Zweig N" first="N" last="Zweig">N. Zweig</name>
<affiliation><nlm:affiliation>Aix Marseille Univ.</nlm:affiliation>
<wicri:noCountry code="no comma">Aix Marseille Univ.</wicri:noCountry>
<wicri:noCountry code="no comma">Aix Marseille Univ.</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Aribi, M" sort="Aribi, M" uniqKey="Aribi M" first="M" last="Aribi">M. Aribi</name>
<affiliation wicri:level="1"><nlm:affiliation>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University, Algeria.</nlm:affiliation>
<country xml:lang="fr">Algérie</country>
<wicri:regionArea>The Laboratory of Applied Molecular Biology and Immunology, Tlemcen University</wicri:regionArea>
<wicri:noRegion>Tlemcen University</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Gonzalez, A" sort="Gonzalez, A" uniqKey="Gonzalez A" first="A" last="Gonzalez">A. Gonzalez</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
<author><name sortKey="Spicuglia, S" sort="Spicuglia, S" uniqKey="Spicuglia S" first="S" last="Spicuglia">S. Spicuglia</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
<affiliation><nlm:affiliation>Equipe Labellisée LIGUE contre le Cancer.</nlm:affiliation>
<wicri:noCountry code="no comma">Equipe Labellisée LIGUE contre le Cancer.</wicri:noCountry>
<wicri:noCountry code="no comma">Equipe Labellisée LIGUE contre le Cancer.</wicri:noCountry>
</affiliation>
</author>
<author><name sortKey="Puthier, D" sort="Puthier, D" uniqKey="Puthier D" first="D" last="Puthier">D. Puthier</name>
<affiliation wicri:level="3"><nlm:affiliation>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille, France.</nlm:affiliation>
<country xml:lang="fr">France</country>
<wicri:regionArea>Aix Marseille Univ, INSERM, TAGC, UMR U1090, Marseille</wicri:regionArea>
<placeName><region type="region">Provence-Alpes-Côte d'Azur</region>
<region type="old region">Provence-Alpes-Côte d'Azur</region>
<settlement type="city">Marseille</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series><title level="j">Bioinformatics (Oxford, England)</title>
<idno type="eISSN">1367-4811</idno>
<imprint><date when="2019" type="published">2019</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Computational Biology (MeSH)</term>
<term>Genomics (MeSH)</term>
<term>Software (MeSH)</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr"><term>Biologie informatique (MeSH)</term>
<term>Génomique (MeSH)</term>
<term>Logiciel (MeSH)</term>
</keywords>
<keywords scheme="MESH" xml:lang="en"><term>Computational Biology</term>
<term>Genomics</term>
<term>Software</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr"><term>Biologie informatique</term>
<term>Génomique</term>
<term>Logiciel</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en"><p><b>MOTIVATION</b>
</p>
<p>While Python has become very popular in bioinformatics, a limited number of libraries exist for fast manipulation of gene coordinates in Ensembl GTF format.</p>
</div>
<div type="abstract" xml:lang="en"><p><b>RESULTS</b>
</p>
<p>We have developed the GTF toolkit Python package (pygtftk), which aims at providing easy and powerful manipulation of gene coordinates in GTF format. For optimal performances, the core engine of pygtftk is a C dynamic library (libgtftk) while the Python API provides usability and readability for developing scripts. Based on this Python package, we have developed the gtftk command line interface that contains 57 sub-commands (v0.9.10) to ease handling of GTF files. These commands may be used to (i) perform basic tasks (e.g. selections, insertions, updates or deletions of features/keys), (ii) select genes/transcripts based on various criteria (e.g. size, exon number, transcription start site location, intron length, GO terms) or (iii) carry out more advanced operations such as coverage analyses of genomic features using bigWig files to create faceted read-coverage diagrams. In conclusion, the pygtftk package greatly simplifies the annotation of GTF files with external information while providing advance tools to perform gene analyses.</p>
</div>
<div type="abstract" xml:lang="en"><p><b>AVAILABILITY AND IMPLEMENTATION</b>
</p>
<p>pygtftk and gtftk have been tested on Linux and MacOSX and are available from https://github.com/dputhier/pygtftk under the MIT license. The libgtftk dynamic library written in C is available from https://github.com/dputhier/libgtftk.</p>
</div>
</front>
</TEI>
<affiliations><list><country><li>Algérie</li>
<li>France</li>
<li>Émirats arabes unis</li>
</country>
<region><li>Provence-Alpes-Côte d'Azur</li>
</region>
<settlement><li>Marseille</li>
</settlement>
</list>
<tree><noCountry><name sortKey="Zweig, N" sort="Zweig, N" uniqKey="Zweig N" first="N" last="Zweig">N. Zweig</name>
</noCountry>
<country name="France"><region name="Provence-Alpes-Côte d'Azur"><name sortKey="Lopez, F" sort="Lopez, F" uniqKey="Lopez F" first="F" last="Lopez">F. Lopez</name>
</region>
<name sortKey="Charbonnier, G" sort="Charbonnier, G" uniqKey="Charbonnier G" first="G" last="Charbonnier">G. Charbonnier</name>
<name sortKey="Ferre, Q" sort="Ferre, Q" uniqKey="Ferre Q" first="Q" last="Ferré">Q. Ferré</name>
<name sortKey="Gonzalez, A" sort="Gonzalez, A" uniqKey="Gonzalez A" first="A" last="Gonzalez">A. Gonzalez</name>
<name sortKey="Kermezli, Y" sort="Kermezli, Y" uniqKey="Kermezli Y" first="Y" last="Kermezli">Y. Kermezli</name>
<name sortKey="Puthier, D" sort="Puthier, D" uniqKey="Puthier D" first="D" last="Puthier">D. Puthier</name>
<name sortKey="Spicuglia, S" sort="Spicuglia, S" uniqKey="Spicuglia S" first="S" last="Spicuglia">S. Spicuglia</name>
</country>
<country name="Algérie"><noRegion><name sortKey="Kermezli, Y" sort="Kermezli, Y" uniqKey="Kermezli Y" first="Y" last="Kermezli">Y. Kermezli</name>
</noRegion>
<name sortKey="Aribi, M" sort="Aribi, M" uniqKey="Aribi M" first="M" last="Aribi">M. Aribi</name>
</country>
<country name="Émirats arabes unis"><noRegion><name sortKey="Belhocine, M" sort="Belhocine, M" uniqKey="Belhocine M" first="M" last="Belhocine">M. Belhocine</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Sante/explor/MaghrebDataLibMedV2/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000539 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000539 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Sante |area= MaghrebDataLibMedV2 |flux= Main |étape= Exploration |type= RBID |clé= pubmed:30768152 |texte= Explore, edit and leverage genomic annotations using Python GTF toolkit. }}
Pour générer des pages wiki
HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i -Sk "pubmed:30768152" \ | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd \ | NlmPubMed2Wicri -a MaghrebDataLibMedV2
This area was generated with Dilib version V0.6.38. |